A factor of i.i.d with uniform marginals 
and infinite clusters spanned by equal labels 

by Peter Mester 



Abstract. We give an example of a factor of i.i.d. labeling of the 3-regular 
tree with the property that the marginal labels are uniform and the equivalence 
classes of equal labels contain infinite clusters. 



§1. Intoduction and conventions. 

In his ICM survey [2], in slightly different language Gaboriau asks the following as 
Question 5.6: 

Question 1.1. Assume that is a [0, l]-labeling of the vertices of some Cayley graph 
which arises as a factor of i.i.d. uniform labeling of the vertices and assume that the 
marginals of on the vertices are themselves uniform. Does it follow that almost surely 
every cluster spanned by vertices whose labels are the same is finite? 

In this paper we show by example that the answer is no. In fact in our example there 
will be only infinite components. Note however, that the answer is known to be yes if we 
replace "finite" with "hyperfmite" (meaning that the corresponding equivalence relation 
can be obtained as an increasing union of equivalence relations with finite partition classes 
only) . The latter is a result of Chifan and Ioana [1] . The Cayley graph we use will be the 
3-regular tree T%. The result can be extended to any non-amenable Cayley graph using a 
theorem of Gaboriau and Lyons [3]. 

By a uniform random variable we will always mean one which is uniform on [0,1]. 
Let Gbea countable group. A function / : [0, 1] G — > [0, 1] G is called equivariant if for all 
oo £ [0, 1] G and 7 G G we have jf(oj) = /(7a;). 

Then a factor of i.i.d. uniform on G is a [0, 1] G - valued random object which can be 
obtained the following way: First, an equivariant measurable function / : [0, 1] G — > [0, 1] G 
is given. Using / we define the random object as follows: take a family of i.i.d. uniform 
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random variables {U v } ve o and then apply /. The random elements of [0, 1] G arising in this 
way are called factors of i.i.d. uniform. In this paper we will call them simply "factors" . 

A factor is an invariant process; by an invariant process on a G we mean [0,l] G -valued 
random object 1Z whose distribution is invariant under group multiplications. The precise 
meaning of this is that if x±, . . . , x n , 76 G, then the distribution of (7^(7x1), . . . , 1Z(^x n )) 
does not depend on 7. Certain claims we need about our construction will already follow 
from basic principles about invariant processes. 

Now we explain some conventions we use. 

In explaining our constructions it will be convenient to use other label sets as well, 
meaning that we will use S G valued random elements as well where S will be different 
from [0, 1]. In that case we will often call these objects S-labelings. In our case we will 
encounter {0, 1} and {0, l} n -labelings. 

When a {0, l}-labeling C has the property that C has the same distribution as (1 — £), 
then we call it symmetric. 

Note that if we have a vertex labeling of a graph with i.i.d. uniform labels, then it can 
be used to obtain a further labeling so that over each vertex we have an infinite sequence 
of i.i.d. uniform labels. This is so because the binary digits of a uniform random variable 
are independent and so can be "reorganized" to determine infinitely many elements from 
[0, 1]. We make explicit in the final section how we actually use this reorganization. 

If a G [0, 1] let biti(a) denote the ith binary digit of a. To avoid ambiguity let us use 
the binary representation which contains infinitely many l's. 

If T is a tree and £ is a labeling of V(T), then clust(£) will be the forest obtained by 
deleting edges between vertices of different £-label. We will call these trees as ^-clusters. 
In general when we talk about clusters in a context where a labeling is understood we 
mean a connected component spanned by vertices whose label is the same. 

Call a v G V(T) a furcation if after deleting v from T among the remaining components 
there are at least 3 infinite ones. If a tree T has a furcation we will say that T is forking. 

If T is a graph by a Bernoulli(|) percolation on T we mean a {0, l}-labeling where 
the individual labels are mutually independent. The basic theory of Galton- Watson trees 
implies the following well-known fact: If T any tree whose minimal degree is at least 4, 
then a Bernoulli(^) percolation on T almost surely results in infinite clusters both of label 
and 1. 

Moreover among these infinite clusters there will be some forking. To see this note first 
that by the basic theory of Galton- Watson processes this happens with positive probability. 
This is also true if we consider trees which have one vertex of degree 2 and all others of 
degree 4. So if we delete the edges of a bi-infinite path, then the Bernoulli (|) process 
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restricted to the remaining components will result in forking clusters with probability 
c > within each of these remaining components independently. This implies that one of 
these events almost surely will happen. 

In fact almost surely all of the infinite clusters are forking but for our purpose the 
mere existence is enough. 

We will often use partitions of V(T) for a tree T and we will call the partition classes 
cells (or Il-cells). Note also that the partitions we encounter will span a connected subgraph 
of T and whenever we mention partitions we mean a partition with that property. If x is 
a vertex of T let Ii{x) denote the Il-cell containing x. If IT is such a partition, then by 
T n we mean the following tree: Its vertices are the cells of IT and two cells are connected 
by an edge iff their distance is 1 in T. This is indeed a tree by the connectedness of the 
cells. If we have two partitions IIi,n2 on the same set, then let IIi -< II2 mean that IIi is 
a refinement of II2. If J 7 is a spanning forest (meaning that every vertex is included), then 
we will call the J 7 and IT compatible if for every component T of J 7 the vertex set V(T) is 
a union of Il-cells. 



§2. Outline of the construction. 

Here is a brief outline how the construction works. The labeling which witnesses 
the negative answer to Question [O] will be denoted as A. We will construct A in steps 
indexed by the positive integers. In particular in the ith step we will construct the ith 
binary digit of the labels. Importantly for a fixed vertex o the ith bit of A(o) will be 
fair and independent of the previous bits. This already implies that the marginals are 
uniform. The fairness of the ith bits will follow from the fact that the ith bit itself will be 
a symmetric labeling. 

In the process we will have a sequence of pairs (J-^, IIi) where Ti is a spanning forest 
of T3 and IIi is a partition of V(T). These forest will consist of infinite trees only and they 
will be "big" in that sense that if T is a tree of then T Ui will have minimal degree at 
least 4. 

These pairs will be related in such a way that IIi -< ^-i+i will always hold and a 
component T + of J~i+i will always be a subtree of some component T of Ti. Moreover if T 
is a component of Ti, then the first i bits constructed up to the ith step will be the same 
for any two vertices of T. Observe that the above already implies the claim: if we have a 
vertex x, then the increasing union of Ili(x), . . . , U n (x), . . . will be an infinite component 
over which all the bits will be the same. 

Now we say somewhat more about how the labeling will work. 
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If we label the II-cells with independent fair bits than it can be considered as Bernoulli(^) 
percolation on all the different T Ui 's. But our condition on (J^, Ilj) implies that all of these 
trees have minimal degree at least 4. So when these trees split into the appropriate clusters 
there will be some forking clusters in each tree. To finish the process we "clean up" the 
remaining non-forking clusters by changing some of the labels so that each merges into one 
of the forking clusters. So when a new bit of A is revealed the forest we had so far will split 
into smaller trees. Before turning to the new bits we enlarge the partition cells so that the 
"bigness" condition still holds and we continue. It will be clear that the marginals of the 
ith bit will be fair bits. 



§3. Merging small clusters into big ones. 

Assume that we classify trees as "small" and "big" and our classification scheme has 
the natural property that being big is upward closed in the sense that if T is a tree which 
contains a big subtree, then T is itself big. Two natural examples are: "being infinite" 
and "being forking" . In this section we show that if we have a {0, l}-labeling which almost 
surely has big clusters, then there is a natural way to "merge" the small clusters into the 
big ones so that at the end only big ones remain. Practically it will be achieved through a 
"relabeling" which replaces the old label £ by a new one C* so that all the clusters of the 
C* label are big. 

Note that this merging process will not need any extra randomness and also has the 
following feature: If the original label C was random and symmetric, then C* will also be 
symmetric. 

We assume that a spanning forest J 7 is given with some {0, l}-labeling C. Within each 
tree we apply the process we are going to describe. So T below is going to be a component 
of J 7 and we assume that the C labeling restricted to it has at least one big component. 

For the sake of being specific we will talk about non-forking and forking trees but it 
will be clear that the following works for any other distinction of being small and big. 

We start with a {0, l}-labeling C on some tree T and importantly we assume that 
£ has forking clusters. Then this relabeling results in a new {0, l}-labeling r + (£) which 
will have only forking clusters. Importantly this is realized as an equivariant deterministic 
function of C. This means that if C itself was a factor of i.i.d. , then r + (£) will be one 
as well (measur ability will be obvious). The idea of this relabeling is very simple: if a 
non-forking £-cluster C is at distance 1 from a forking one, then every vertex in C switches 
its label so that C "joins" the forking cluster. In this way the forking clusters have grown 
and we iterate the process. 
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If we want to emphasize the dependence on J 7 we will use the notation r; 



§4. Some operations for partitions. 

Here we collect some operations related to partitions. 

If we have a partition S on V(T n ), then there is a natural corresponding partition 
down(S) on V(T): Take a S-cell C which is a subset of V{T n ), then taking [jC gives a 
subset of V(T) and these subsets form a partition we denote by down(S). 

The next operation needs a tree T, a distinguished vertex set S C V(T), and a family 
of distinct real numbers a(x) x€ y(T 3 ) as input. We associate to them a partition which 
is very similar to the so called Voronoi tiling. Because of this similarity we denote this 
partition by Vor(<S', a). The en's are needed to take care of a potential ambiguity. 

If v G V(T), then let S v C S be the set of those elements of S which are closest to 
v, i.e., S v := {s; distr(v, s) = distr(t> , £)}• If S v was a singleton we just declare v to be 
equivalent to its unique member. However, in general S v may have more than one element 
and for a consistent choice we rely on the a values. Let (/>(s, a )( v ) '■= s o De that element of S v 
for which a(so) is minimal. Let two vertices v\,V2 be equivalent if (f>(s,a)( v i) = 0(5,a)( v 2) 
and let Vor(5', a) be the partition corresponding to this equivalence. It is easy to check that 
in this way the partition classes are connected. Note that in this way each \/or(S, a)-cell 
contains exactly one element from S (this will be relevant in the next section). 

For the next two operations we have a family of pairs of real numbers 

{(3(x)} xeV(T3) := {(Pdx), p 2 (x))} x€V{Ts) . 

We define a "lifting" liftn(/3i, P2) of them which will be defined over the cells of n: 
If c is a n-cell let v e c be that vertex for which f3i(v) = min{/3i(u>); w G c}. Then let 
liftn(/3i,/3 2 )(c) :=fc(v). 

Define a {0, l}-labeling of the cells of n as bit n (/5) := bit x (lift n (A , fc)). 

The remaining two operations need a spanning forest T of T3, an ^-compatible par- 
tition n on V(Ts), and some families of real numbers associated to the vertices of T 3 . The 
output of these operations will be some new partition or some labeling associated to the 
cells of the partition. In order to be well-defined we will need some extra assumptions 
about the possible inputs. The following assumptions are enough for the operations and 
we assume them throughout: 

Each component of J 7 is forking, each cell of n is finite, n is ^-compatible, and 
the families of real numbers consist of distinct elements. The operations will be defined 
component-wise so let T be any component of T . 
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We again need a family of pairs of real numbers 

{P{x)}x€V(T 3 ) ■= {(Pl(x),p2(x))}x€V(T 3 )- 

We associate to the triple (J 7 , IT, (3) a new partition Il + = Fur(J r , IT, (3) so that IT -< 11+ and 
every cell of Il + contains a finite but non-zero number of furcations of T. Consider the tree 
T n and let S C V(T U ) be its set of furcations. Let /3+ := lift n (/3i, (3 2 ). Take the partition 
Vor(S', f3 + ) in T n and finally use the down operation to obtain Il + := Fur(J r , IT,/?) := 
down(Vor(S,/3+)) . 

For the last operation we need a notation: In the graphs considered as natural metric 
spaces the ball of radius n around a vertex v will be denoted as B n (v). If we need to 
indicate the graph G, then we denote it by B n (v, G). 

For the next operation we use a quadruple of 4-families of real numbers 

{l(x)}x€V(T 3 ) ■= {(ll(x), 72(a), 73(x), M x ))}x€V(T 3 )- 

We associate to this data a new partition IT* = Sep(J r , 11,7). Distinguish a subset R C 
V(T n ) as follows. Let 7* := lift n (7i, 7 2 )- Let r G F(T n ) be in i? iff 7 *(c) is minimal 
among the 7*(i>)'s for v G B 3 (r, T n ). Note that i? is separated in the sense that it cannot 
contain any two IT cells whose distance is at most 2 in T n . To obtain IT* we consider 
the tree T n and define 7° := liftn(7 3 , 7 4 ), then take Vor(i?, 7°). This is a partition on 
V(T n ) and we apply the down operation to push it back to be a partition on V{T), i.e., 
IT* := Sep(J*,n, 7 ) := down(Vor(#, 7 D )). 

Note that Ii{x) C II*(a;) for any vertex x (the emphasis is on the fact that U(x) 7^ 
II*(a;)). This is because if c is a Il-cell which is also in i?, then for each neighbor c\ of c in 
T n we have c\ G Vor(i?, 7 D )(c) (because c is the unique closest element of R to c\ by the 
separation property). Note also, that if (J 7 , IT) had the property that for each component 
T of J 7 the minimal degree in T n is at least 3, then the minimal degree in T n is at least 
4. 

When the above operations are used on data which are random and invariant, then 
the result almost surely will satisfy some extra properties which we are going to explore 
in the next section. 
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§5. The Mass- Transport Principle. 

This section owes a lot to the exposition in [4]. Here we recall the Mass- Transport 
Principle which is a useful tool in studying invariant processes on a group G. Since we are 
interested in these processes mostly how they behave on a Cayley graph of G we will refer 
to the elements of G as vertices. 

Assume that 1Z is a probability measure on O := [0, 1] . Let F:GxGxO->[0,oo] 
be a diagonally invariant measurable function (meaning that F(x,y,u) = F(jx,jy,ju) 
for all 7 G G). The quantity F(x,y,u) is often called the mass sent by x to y or the mass 
received by y from x. The Mass- Transport Principle says that if 1Z is invariant, then for 
the identity o eV the expected overall mass o receives is the same as the expected overall 
mass it sends out. Now we formalize and prove this: 

Theorem 5.1. IflZ and F are as above, 1Z is invariant, f(x,y) := EfcF(x,y,*), then 

xev xev 

To prove it, first observe that the invariance of 1Z implies that / is also diagonally 
invariant. This implies that f(o,x) = f(x~ 1 o,x~ 1 x) = f{x~ l ,o) and this finishes the 
proof since inversion is a bijection. 

A simple application of this is the following. Assume that some invariant process 
determines some partitioning of the vertices (where we do not know a priori that the 
partition classes are finite) and that within each partition class there is finite nonempty 
subset of distinguished vertices (still determined by the invariant process). Then we can 
conclude that the partition classes are almost surely finite. If not, then first we can select 
uniformly a single distinguished vertex and then the following event would have positive 
probability: "the origin is the distinguished vertex of some infinite partition class" . This 
means that the following mass transport would contradict the mass transport principle: 
F(x,y,u) := 1 if x is the distinguished vertex of the partition class containing y in the 
random configuration u. Then the expected mass the origin would receive would be no 
more than one (this is true even pointwise). However, the expected mass it would send 
out is infinite (it even would send out infinite mass with positive probability). 

Observe that this means that for the operations Fur, Sep if their input data was de- 
termined by an invariant random process (assume that under this process the input data 
almost surely satisfied the requirements we made for our operations to be well-defined), 
then applying them almost surely result in partitions which are finite. This is because the 
new partitions contained a finite nonempty subset of distinguished vertices. 
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Note a particular consequence of this: If J 7 is a random invariant spanning forest of 
T3 whose components are almost surely forking, IT is an invariant J-"-compatible random 
partition almost surely with finite cells, and {W(x)} xe v(T 3 ) = {U(x), V(x)} xe v(T 3 ) is 
an i.i.d family of pairs of i.i.d. uniform random variables, then IT" 1 " := Fur(J-", IT, W) is a 
partition with almost surely finite cells. This implies that for any component T of T the 
tree T n+ almost surely has minimal degree at least 3. This is because each finite cell C 
of IT" 1 " contained a furcation so if we delete the vertices of C there are at least 3 infinite 
components remaining (since we have just deleted finitely many vertices). Moreover, those 
infinite components are partitioned into cells of finite size under n + so they "stay" infinite 
in T n+ as well. So T n+ is a tree each of whose vertices is a furcation which implies that 
each vertex has degree at least 3. 



§6. The actual factor. 

We now turn to the description of our factor label A witnessing the negative answer 



to Question 1.1 



First we use the fact that a single uniform random variable determines countable many 
i.i.d. uniform ones. Organize them to obtain three independent sequences: 

Uq , . . . , U n , . . . , 

Vi, . . . , V m , . . . , 
Bi, . . • , Bfc, . . . , 

where U n = (Ui, U2, Us, U^) n is a quadruple whose components are i.i.d. uniform labelings 
of V(Ts), V m = (Vi, V2)m is a pair whose components are i.i.d. uniform labelings of V{Ts), 
and Bfc = {B\,B2)k is a pair whose components are i.i.d. uniform labelings of V(Ts). 

In order to make our process well-defined for all initial labelings of [0, l] y ( T3 ) (not 
just almost surely) we make the following comments. In what follows we often state and 
use that certain events hold almost surely. If any of these events do not hold, then the 
final label A is set to be identically (regardless of how "far" the process below has been 
defined). Note that with this convention we always get infinite clusters spanned by vertices 
of the same labels (and not just almost surely). 

The construction will work in steps indexed by the positive integers. The nth step 
reveals the nth bit of A. 

At the end of step n we will have a forest T n and a partition II n . The components of 
T n will exactly be the clusters corresponding to the first n bits of A, the partition II n will 
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be compatible with T n and II n -< II n+ i will also hold. Importantly, if T is a component 
of T n , then almost surely T n ™ will have minimal degree at least 4. 

Start the first step by setting Tq to be the forest consisting of the single tree T 3 and 
let U-i be the partition where every cell is a singleton. Let n := Sep(J r > n_i, Uq). 

Note that then almost surely has minimal degree at least 4 and now we have 
defined the pair (J^jIIq). 

Label the cells of n by i.i.d. fair bits Af := bit n (Bi) (this means that every vertex in 
a given cell gets the same label but for vertices in different cells these labels are mutually 
independent). Here g stands for "guess" as this is our first guess what Ai should be. 

This is a Bernoulli(^) percolation on . Since the latter is a tree whose minimal 
degree is at least 4 the forest clust(Af) of percolation clusters almost surely will contain 
forking components. Because of that the operation (Af ) is well defined and let us define 
Ai := r?(Af). 

Note that Ai is constant over each cell of n . So if T\ = clust(Ai), then J 7 ! is 
n -compatible. 

Note also the important fact that if T is any Ai-cluster, then it is forking. Then we 
can define IT J as IT + := Fur(J r i, n , Vi). Then, by the argument at the end of the last 
section, for every component T of T\ the tree T n o has minimal degree at least 3. 

Then to conclude this step we define LTi to be LTi := Sep(.Fo, nj, U\). Almost surely 
for any component T of T\ the tree T Ul has minimal degree at least 4. 

We have just described how to obtain (J^ni) from (J^qjTIq). Now we turn to the 
general step. 

In general what happens when we iterate our process is this: We have a forest T n and 
a partition II n so that they are compatible and almost surely for any tree T of T n the tree 
T n ™ has minimal degree at least 4. 

At the same time we have also defined the first n bits Ai, . . . , A n so we naturally have a 
labeling (A 1; . . . , A n ) of V(T 3 ) by elements of {0, l} n and it also holds that clust((Ai, . . . , A n )) = 

T 

•J n- 

To proceed to the next step label the cells of II n by i.i.d. fair bits A^ +1 := bit nn (B n+1 ). 

As before, this imitates a Bernoulli (|) percolation over trees (i.e., take a tree T from 
T n and consider T n ") which have minimal degree at least 4 so almost surely there will be 
some forking A^ +1 -cluster within each tree of T n . 

Then let A n+1 := rf"(A£ +1 ). 

To continue the iteration step let JSi+i := clust((Ai, . . . , A n+ i)). Note that indeed 
J-'n+i is Il n -compatible and every component T of J-'n+i is forking so we can define 11+ := 
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Fur(J r n+1 , V n+ i). As above, for any component T of F n+ \ the tree T n » has minimal 
degree at least 3. To complete the iteration step let II n+1 := Sep(J r n+ i, 11+ , U n+ i). 

This way we indeed obtain our sequence Ai, . . . , A n , ... as promised. 

Finally A let be the labeling for which bitj(A) = A*. We claim that every vertex v is 
in an infinite A-cluster. The reason for this is the fact that Ii n -< Ii n +i so for a vertex v 
we have H n (v) C H n+ i(v) and note that the inclusion is proper and these cells are always 
connected so their increasing union is an infinite connected component C(v) containing v. 
From the construction it is also clear that A(x) = A(v) for any vertex x of C(v). I 
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